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We study a general set of models of social network evolution and dynamics. The models consist of both a 
dynamics on the network and evolution of the network. Links are formed preferentially between 'similar' nodes, 
where the similarity is defined by the particular process taking place on the network. The interplay between the 
two processes produces phase transitions and hysteresis, as seen using numerical simulations for three specific 
processes. We obtain analytic results using mean field approximations, and for a particular case we derive an 
exact solution for the network. In common with real-world social networks, we find coexistence of high and 
low connectivity phases and history dependence. 
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I. INTRODUCTION 



In recent years physicists have paid much attention to net- 
work structures - describing either technological infrastruc- 
tures or biological, genetic, logical or social relationships - as 
they play a prominent role in shaping the nature of the pro- 
cesses taking place on them and the resulting collective be- 
havior Examples of how the structure affects the function of 
networked systems include the importance of shortcuts in en- 
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dowing finite dimensional networks of the small world prop- 
erty Ql] and of scale-free degree distribution for robustness 
against failure H or the relevance of motifs for specific dy- 
namical properties |31- 

Socio-economic networks offer an example where the rela- 
tion between structure and function is not unidirectional. In- 
deed their structure is inherently dynamical and it is shaped 
by the incentives of agents, i.e. by the socio-economic func- 
tions provided by the network. This paper discusses a class of 
generic model of stochastic dynamical social networks which 
make the interplay between structure and function of social 
network explicit in a simple way. We consider a set of agents 
- be they individuals or organisations - who establish bilat- 
eral interactions (links) when profitable. The network evolves 
under changing conditions. That is, the favourable circum- 
stances that led at some point to the formation of a particu- 
lar link may later on deteriorate, causing that link's removal. 
Hence volatility (exogenous or endogenous) is a key disrup- 
tive element in the dynamics. Concurrently, new opportunities 
arise that favour the formation of new links. Whether linking 
occurs depends on factors related to the similarity or prox- 
imity of the two parties. For example, in cases where trust 
is essential in the establishment of new relationships (e.g. in 
crime or trade networks), linking may be facilitated by com- 
mon acquaintances or by the existence of a chain of acquain- 
tances joining the two parties. In other cases (e.g. in R&D 
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or scientific networks), a common language, methodology, or 
comparable level of technical competence may be required for 
the link to be feasible or fruitful to both parties. 

In a nutshell, our model conceives the dynamics of the net- 
work as a struggle between volatility (that causes link decay) 
on the one hand, and the creation of new links (that is depen- 
dent on similarity) on the other The model must also specify 
the dynamics governing inter-node similarity. A reasonable 
assumption in this respect is that such similarity is enhanced 
by close interaction, as reflected by the social network. For 
example, a firm (or researcher) benefits from collaborating 
with a similarly advanced partner, or individuals who inter- 
act regularly tend to converge on their social norms and other 
standards of behavior 

We study different specifications of the general framework, 
each one embodying alternative forms of the intuitive idea that 
"interaction promotes similarity." Our main finding is that in 
all of these different cases the network dynamics exhibits a 
rich phenomenology characterized by a) sharp phase transi- 
tion b) resilience, i.e. stability against deteriorating condi- 
tions and c) equilibrium coexistence. The essential mecha- 
nism at work is a positive feedback between link creation and 
inter-node similarity, these two factors each exerting a positive 
effect on the other Feedback forces of this kind appear to op- 
erate in the dynamics of many social networks. We show that 
they are sufficient to produce the sharp transitions, resilience, 
and equilibrium co-existence that, as we will discuss in the 
next section, are salient features of many socio-economic phe- 
nomena. Finally, this phenomenology bears a formal similar- 
ity with the liquid-gas phase transition, thus suggesting that 
a classification in terms of phases may be applicable also to 
socio-economic networks. 

The rest of this paper is organized as follows: The next sec- 
tion discusses in an introductory way the empirical evidence 
which our model addresses. In section |lll] we outline the gen- 
eral setup and a generic model of which we will discuss par- 
ticular realizations the following sections. In particular, we 
shall first discuss the case where network formation depends 
on the topology of the network (sect. IIV> and then cases where 
it is coupled with the dynamics of a continuous (sect. |V) or 
discrete (sect. IV II I variable. These two models addresses sit- 



uations where homogeneity in some dimension (e.g. techno- 
logical levels or knowledge) or coordination (on e.g. a stan- 
dard) play a crucial role, respectively. Numerical simulations 
will be supplemented by mean field analysis which provides 
a correct qualitative picture in all cases and, in some cases, 
accurate quantitative estimates. A case where an exact solu- 
tion can be derived will be described in section lVTII In section 
IVlIll we end with some concluding remarks. 



II. EMPIRICAL STYLIZED E\CTS OF 
SOCIO-ECONOMIC NETWORKS 

There is a growing consensus among social scientists that 
many social phenomena display an inherent network dimen- 
sion. Not only are they "embedded" in the underlying so- 
cial network 0] but, reciprocally, the social network itself is 
largely shaped by the evolution of those phenomena. The 
range of social problems subject to these considerations is 
wide and important. It includes, for example, the spread of 
crime |i5i |6J and other social problems (e.g. teenage preg- 
nancy 0131)' the rise of industrial districts |9, 10, Ijl], and 
the establishment of research collaborations, both scientific 
lll2i ri3l and industrial 1 14, 15]. Throughout these cases, there 
are a number of interesting observations worth highlighting: 

(a) Sharp transitions: The shift from a sparse to a highly 
connected network often unfolds rather "abruptly," i.e. in a 
short timespan. For example, concerning the escalation of so- 
cial pathologies in some neighborhoods of large cities. Crane 
[7] writes that "...if the incidence [of the problem] reaches a 
critical point, the process of spread will explode." Also, con- 
sidering the growth of research collaboration networks, Goyal 
et al. ] 13] report a steep increase in the per capita number of 
collaborations among academic economists in the last three 
decades, while Hagerdoorn ] 14] reports an even sharper (ten- 
fold) increase for R&D partnerships among firms during the 
decade 1975-1985. 

(b) Resilience: Once the transition to a highly connected 
network has taken place, the network is robust, surviving 
even a reversion to "unfavorable" conditions. The case of 
California's Silicon Valley, discussed in a classic account by 
Saxenian ] 10], illustrates this point well. Its thriving per- 
formance, even in the face of the general crisis undergone 
by the computer industry in the 80's, has been largely at- 
tributed to the dense and flexible networks of collaboration 
across individual actors that characterized it. Another intrinsi- 
cally network-based example is the rapid recent development 
of Open-Source software (e.g. Linux), a phenomenon sus- 
tained against large odds by a dense web of collaboration and 
trust ] 16]. Finally, as an example where "robustness" has neg- 
ative rather than positive implications. Crane ] 7] describes the 
difficulty, even with vigorous social measures, of improving 
a local neighborhood once crime and other social pathologies 
have taken hold. 

(c) EquUibrium co-existence: Under apparently simi- 
lar environmental conditions, social networks may be found 
both in a dense or sparse state. Again, a good illustration 



is provided by the dual experience of poor neighborhoods 
in large cities 0], where neither poverty nor other socio- 
economic conditions (e.g. ethnic composition) can alone ex- 
plain whether or not there is degradation into a ghetto with 
rampant social problems. Returning to R&D partnerships, 
empirical evidence I14ll shows a very polarized situation, 
almost all R&D partnerships taking place in a few (high- 
technology) industries. Even within those industries, partner- 
ships are almost exclusively between a small subset of firms 
in (highly advanced) countries ] 17]. 

From a theoretical viewpoint, the above discussion raises 
the question of whether there is some common mechanism at 
work in the dynamics of social networks that, in a wide variety 
of different scenarios, produces the three features explained 
above: (a) discontinuous phase transitions, (b) resilience, and 
(c) equilibrium coexistence. Our aim in this paper is to shed 
light on this question within a general framework that is flexi- 
ble enough to accommodate, under alternative concrete spec- 
ifications, a rich range of social-network dynamics. 



III. THE MODEL 

Consider a set A/" = {1, ... ,71} of agents, whose state 
and interactions evolve in continuous time t. They form the 
nodes of a network which is described by a non-directed graph 
g{t) C {ij : i e Af, j e A/"}, where ij{= ji) e g{t) iff a 
link exists between agents i and j. The network evolution is 
modelled in terms of continuous time stochastic elementary 
Poisson processes, and it is therefore defined by specifying 
the rates at which these processes occur ll^ . Firstly, each 
node i receives an opportunity to form a link with a node j, 
randomly drawn from TV (i 7^ j), at rate rj. If the link ij is not 
already in place, it forms with probability 



P{ij ^ 9{t)} = 



1 ifdy(i)<4 
e if dij{t) > d 



(1) 



where dij {t) is the "distance" (to be specified later) between i 
and j prevailing at t. Thus if i and j are close, in the sense that 
their distance is no higher than some given threshold d, the 
link forms at rate r/; otherwise, it forms at a much smaller rate 
rjE. Secondly, each existing link ij e g{t) decays at rate A. 
That is, each link in the network disappears with probability 
Xdt in a time interval [t, t + dt). We shall discuss three differ- 
ent specifications of the distance dij, each capturing different 
aspects that may be relevant for socio-economic interactions. 
In all three cases, we mostly focus on the stationary state 
behavior, which we shall illustrate using both numerical sim- 
ulations and a mean-field analytic approach. Concerning the 
latter, we focus on the limit n ^ 00, for which the analy- 
sis is simpler We characterise the long run behavior of the 
network solely in terms of the stationary degree distribution 
p{k), which is the fraction of agents with k neighbours. This 
corresponds to neglecting degree correlations, i.e. to approx- 
imating the network with a random graph (see (T^), an ap- 
proximation which is reasonably accurate in the cases we dis- 
cuss here. The degree distribution satisfies a master equation. 



which is specified in terms of the transition rates 'w{k -^ fc±l) 
for the addition or removal of a link, for an agent linked with 
k neighbours. While w{k -^ fc — 1) = Afc always takes the 
same form, the transition rate for the addition of a new link 

w{k^ k + l) = 2?7 [e + (1 - e)P{d,j < d}] 

depends on the particular specification of the distance dij. 
Matching the link creation and removal processes, yields an 
equation for the degree distribution p{k). The probability 
P{di,j < d}, in its turn, will itself depend on the network 
density, i.e. on p{k). Our approach will then have the flavor 
of a self-consistent mean field approximation. 



IV. SIMILARITY BY (CHAINS OF) ACQUAINTANCES 

Consider first the simplest possible such specification 
where dij{t) is the (geodesic) distance between i and j on 
the graph g{t), neighbours j of i having dij{t) = 1, neigh- 
bours of the neighbours of i (which are not neighbours of i) 
having dij (t) = 2, and so on. If no path joins i and j we set 
dij{t) = oo. 

This specific model describes a situation where the forma- 
tion of new links is strongly influenced by proximity on the 
graph. It is a simple manifestation of our general idea that 
close interaction brings about similarity - here the two met- 
rics coincide. We set d > n — 1, the link formation process 
then discriminates between agents belonging to the same net- 
work component (which are joined by at least one path of links 
in g) and agents in different components. Distinct compo- 
nents of the graph may, for example, represent different social 
groups. Then Eq. Q captures the fact that belonging to the 
same social group is important in the creation of new links 
(say, because it facilitates control or reciprocity r 20ll2lll '). 

Consider first what happens when rj/X is small. Let (k) be 
the average connectivity (number of links per node) in the net- 
work. The average rate nX (fc) /2 of link removal is very high 
when (fc) is significant. Consequently, we expect to have a 
very low (k), which in turn implies that the population should 
be fragmented into many small groups. Under these circum- 
stances, the likelihood that an agent i "meets" an agent j in 
the same component is negligible for large populations, and 
therefore new links are created at a rate equal to nrje. By bal- 
ancing link creation and link destruction, the average number 
of neighbours of an agent is (fc) = 2r/e/A, as is indeed found 
in our simulations (Fig[3. 

As rj/X increases, the network density (k) increases gradu- 
ally. Then, at a critical value (?//A)i = l/2e-when (fc) = 1- 
a giant component forms. The system makes a discontinuous 
jump (Fig. to a state containing a large and densely inter- 
connected community covering a finite fraction of the popula- 
tion. If rj/X decreases back again beyond the transition point 
(ry/A)i, the dense network remains stable. The dense network 
dissolves back into a sparsely connected one only at a sec- 
ond point {r]/X)2- This phenomenology characterises a wide 
region of parameter space (see inset of Fig. Q and is qualita- 
tively well reproduced by a simple mean field approach. 




FIG. 1 : Mean degree {k) as a function of A (77 has been set to 1) for 
e = 0.2 when dij is the distance on the graph and d > n — 1. The 
results of a mean field theory for n = 00 (solid line) are compared to 
numerical simulations ( x ) starting from both low and high connected 
states with n = 20000. The dashed line corresponds to an unstable 
solution of the mean field equations which separates the basins of 
stability of the two solutions. For finite n the low density state "flips" 
to the high density state when a random fluctuation in (fc) brings 
the system across the stability boundary (i.e. when a sizable giant 
component forms). These fluctuations become more and more rare as 
n increases. Inset : Phase diagram in mean field theory. Coexistence 
occurs in the shaded region whereas below (above) only the dense 
(sparse) network phase is stable. Numerical simulations (symbols) 
agree qualitatively with the mean field prediction. The high (low) 
density state is stable up (down) to the points marked with x (o) and 
is unstable at points marked with o (+). The behavior of (k) along 
the dashed line is reported in the main figure. 



It is worth mentioning that a similar phenomenology occurs 
when d = 2, i.e. when Unks are preferentially formed with 
"friends of friends "\2Z\. In this case, however, the probability 
that two arbitrary nodes i and j have dij = 2 is of order 1/n 
in a network with finite degree. Hence for finite e and A non- 
linear effect manifest only for networks of finite sizes f2Z]. 

We finally mention that the model with d = 2 is reminis- 
cent of a model that was recently proposed |23] to describe 
a situation where (as e.g. in job search |24]) agents find new 
linking opportunities through current partners. In |23] agents 
use their links to search for new connections, whereas here ex- 
isting links favour new link formation. In spite of this concep- 
tual difference, the model in Ref. 12311 also features the phe- 
nomenology (a)-(c) above, i.e. sharp transitions, resilience, 
and phase coexistence. 



A. Mean Field Analysis 

The transition rate for the addition of a new link is w{k -^ 
k + 1) = 2rje if the two agents are in different components 
and w{k -^ fc + 1) = 2r/ if they are in the same component, 
where the factor 2 comes because each node can either initiate 
or receive a new link. In the large n limit the latter case only 



occurs with some probability if the graph has a giant compo- 
nent Q which contains a finite fraction 7 of nodes. For random 
graphs (see Ref. I19ll for details) the fraction of nodes in Q is 
given by 



where 



7=1- (/)(w) 



is)^j2p(''>' 



(2) 



(3) 



is the generating function and u is the probability that a link, 
followed in one direction, does not lead to the giant compo- 
nent. The latter satisfies the equation 



u = 0'H/0'(i). 



(4) 



Hence u'^ is the probability an agent with k neighbours has 
no links connecting him to the giant component, and hence 
is itself not part of the giant component. Then the rate of 
addition of links takes the form 



w{k^k + l) = 2r][€ + (1 - e)7(l - u% 



(5) 



The stationary state condition of the master equation leads to 
the following equation for (j){s) 

X4>'{s) = 277[e + (1 - e)j]<j>{s) - 2r]{l - e)70(ws) (6) 

which can be solved numerically to the desired accuracy. No- 
tice that Eq. (|6} is a self-consistent problem, because the pa- 
rameters 7 and u depend on the solution (pis). The solution 
of this equation is summarised in Fig. [Q Either one or three 
solutions are found, depending on the parameters. In the latter 
case, the intermediate solution is unstable (dashed line in Fig. 
[0, and it separates the basins of attraction of the two stable 
solutions within the present mean field theory. 

The solution is exact where there is no giant component, 
and numerical simulations show that the mean field approach 
is very accurate away from the phase transition from the con- 
nected to the disconnected state. Near the transition to the 
disconnected state our approximation, that an agent's degree 
fully specifies its state, breaks down. This causes the theory 
to overestimate the size of the coexistence region. 



V. SIMILARITY OF KNOWLEDGE/TECHNOLOGY 
LEVELS 

Next, we consider a setup where dij reflects proximity of 
nodes i, j in terms of some continuous (non-negative) real 
attributes, hi{t), hj{t). This case has been dealt with in Ref. 
ll25i1 . which provides a detailed socio-economic motivation for 
the model. In short, the attribute hi could represent the level of 
technical expertise of two firms involved in an R&D partner- 
ship, or the competence of two researchers involved in a joint 
project. It could also be a measure of income or wealth that 
bears on the quality and prospects of a bilateral relationship. 
We assume that each agent i receives an attribute update (or 



upgrade) possibility at a rate v. We focus on the case where 
the dynamics of hi is much faster than that of the network 
(h' ^ A, 77). In the opposite limit, links exits for too short a 
time span to have any correlated effect on the dynamics of hi. 
If agent i receives an update opportunity at time t, we posit 
that 



h,it+)^D{h,, jeK{t)} + mit) 



(7) 



where rii{t) is a random term capturing the idiosyncratic 
change of expertise due to i's own (say research) efforts. In 
Eq. Q the function D{-} captures some process of diffu- 
sion (e.g. sharing of knowledge) in the current neighborhood 
M(t) = {j ■■ ij e g{t)} of agent i. 

We will take rii{t) to be Gaussian i.i.d. random variables 
with zero mean and variance A. This random idiosyncratic 
term competes with the homogenising force of diffusion de- 
scribed by the first term in Eq. 0. Concerning this term, we 
will consider two alternative models. 



A. Best-Practise imitation 

The first one, which we will label Best-Practise Imitation 
(BPI) has the revising player achieve a knowledge level equal 
to the maximum available in her neighbourhood. We have in 
mind a situation where individuals aim at improving in the 
direction of increasing hi and they may do this by some on- 
site effort and also by learning from other individuals. 

Formally, this is captured by the following definition: 

D{hj, j e Mi{t) U {z}} = max hj(t). (8) 

Notice that if i has no neighbor {Mi{t) = 0) then D = hi. 
This is equivalent to a directed polymer at zero temperature 
on the (dynamic) network g{t) |26]. A related model, using 
the idea of best practice imitation but with different noise has 
been studied in 1281 , but for randomly chosen neighbours at 
each interaction (i.e. no network). 

Fig. |2]reports typical results of simulations of this model. 
As in the two previous models, we find a discontinuous transi- 
tion between a sparse and a dense network state, characterised 
by hysteresis effects. When the network is sparse, diffusion is 
ineffective in homogenising growth. Hence the distance dy 
is typically beyond the threshold d, thus the link formation 
process is slow. On the other hand, with a dense network, 
diffusion keeps the gaps between the /i^s of different nodes 
small, which in turn has a positive effect on network forma- 
tion. As before, the phase transition and hysteresis is a result 
of the positive feedback that exists between the dynamics of 
the hi and the adjustment of the network. In the stationary 
state we find that h{t) = {hi{t)) grows linearly in time, i.e. 
hi{t) ~ vt. Notably, the growth process is much faster (i.e. v 
is much higher) in the dense network equilibrium than in the 
sparse one, as shown in the lower panel of Fig. |2 

This model exhibits an interplay between the process on the 
network - the hi{t) which depends on the network - and the 




FIG. 2: Mean degree (k) (top) and growth rate v (bottom) as a 
function of rj, found from numerical simulations of the model with 
Eq. Jsj. Shown are simulations with n — 500 (plusses) and 1000 
(crosses). Arrows denote the approximate point at which the system 
jumps from one phase to the other (this point can be dependent on 
n). Here e = 0.001, noise strength A = 0.1, similarity threshold 
d = 2. The system was run up to i = 1000 for equilibration, then 
from i = 1000 to t = 1100 for data taking. 



network evolution which depends on the hi{t). It is this inter- 
dependence and the coiTesponding positive feedback which 
produces the discontinuous transition and phase coexistence. 
The similarity of the behavior of this model with that of the 
previous section can be understood by analyzing a particular 
limit. Consider indeed the case where rji = 1 with probability 
a and r/i ~ otherwise. When va <^ r/ innovations take place 
at a rate much smaller that that over which new links form. In 
the limit where the dynamics of hi is fast enough {ly ;2> rj), we 
can assume that each new innovation (i.e. each event r/i = 1) 
taking place on a connected component instantaneously prop- 
agates to the entire set of connected nodes. Hence, if d < 1, 
link creation will occur with probability one on nodes in the 
same connected component, whereas nodes in different com- 
ponents will likely have distinct values of hi, so that links will 
form at rate rje. Note also that, in this particular limit, the 
growth rate v is proportional to the size of the largest con- 
nected component. 
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B. Conforming to neighbors 



FIG. 3: Mean degree (k) (top) and probability that two randomly 
chosen nodes are within d of each other, n (bottom), as a function 
of rj. Shown are simulations with n = 200 (plusses) and n — 500 
(crosses). Also theory for the high connected phase (dashed line). 
For large ry, the data points converge towards the theory curve as n 
increases. Arrows denote the approximate point at which the system 
jumps from one phase to the other. Here e — 0.001, noise strength 
A = 1, similarity threshold d = 2. The system was run up to 
t = 1000 for equilibration, then from t = 1000 to t = 1100 for data 
taking. 



technological nature) and agents will naturally tend to adjust 
towards their neighbours' levels. In these cases, interaction 
promotes conformity and conformism constraints the creation 
of new links. At any rate, this specification of the model al- 
lows us to understand how the results of the previous section 
depend on the directionality of the diffusion process. 

Fig. |3]shows that this model exhibits the same generic phe- 
nomenology of a sharp transition and the coexistence of sparse 
and dense network phases. The key consideration, in this case, 
is that when the link density is high, the distribution of hi in 
the population is narrow and hence link creation proceeds at a 
relatively fast rate. 

This intuition is captured by a simple mean field approach. 
We will assume that the network can be well approximated by 
an Erdos-Renyi random graph with average degree (k). When 
1/ ^ 7], A, we can assume that the distribution of hi adjusts 
adiabatically to the changing network. In this limit the dy- 
namics is well described by the Edwards-Wilkinson Langevin 
equation ll30ll . 



The second alternative considered has diffusion embody a 
uniform merging of the neighbourhood's levels, formalised as 
follows 



D{h,} 



YjjeUi{t) ^j 



WTl 






(9) 



where |A/i(i)| is the number of agents in i's neighborhood. 
This second formulation can be conceived as reflecting a pro- 
cess of opinion exchange (with no idea of relative "advance" 
in the levels displayed by different individuals) |!27 t 12911 . Al- 
ternatively, it could be viewed as reflecting a context where 
interaction payoffs are enhanced by compatibility (say, of a 






\m 



This can be seen by considering a small time interval dt. If 
vdt 3> 1, the number of updates on each site is large and can 
be approximated with the central limit theorem with the two 
terms in Eq. ( I10> . In this equation, C,i{t) is a white noise term 
with zero average and {Ci{t)Cj{t')) = i'^ASij6{t — t') and 
we have introduced the (normalized) Laplacian matrix of the 
graph C. The dynamics of this model is easily integrated in 
the normal modes of the diffusion operator In other words, let 
v'^ be the eigenvectors of C, i.e. £u^ = fivf^, then the normal 



modes h^^ = J^i "f^i^i satisfy 



h^' = -v^ih^" ~ C 



(11) 



where, in view of the orthogonaHty of the transformation 
I -^ /i, C^ is again a white noise with the same statistical 
properties of Q. The fluctuations of h^ in the stationary state 
are ((/i^ — (/i^))^) == ■^. Back transforming to the variables 
hi one finds that 



where p(/^) is the density of eigenvalues of the Laplacian ma- 
trix, which has been computed in the limit n ^ oo [31]. No- 
tice that we disregard finite size clusters, which contribute to 
a /i = peak in the spectrum, assuming that the hi value of 
these nodes is broadly distributed so that dij > d whenever i 
or i are not in the giant component. There is no simple closed 
form for p(/x), so one has to resort to numerical calculation. 
To our level of approximation, it is sufficient to stick to a sim- 
ple approximation |31], where 



and r(/i) is the solution of 



1 1 

TT ^ - T(/X) ■ 



r(A^) 



-y- 



kP{k) 



(k) ^ kn + i€-{k- i)r(/i) 



(13) 



(14) 



with e ^ 0+. The key features are that: 
• the integral 

dfi 



R{{k)) 



M 



p(m) 



for Erdos-Renyi graphs, is a function of the average de- 
gree (k) alone. 

• The function R{c) decreases monotonically and it di- 
verges as c ^ 1+, when the giant component vanishes 

This allows us to estimate the probabihty 

P{\h, -hj\<d}:^ 6{{k) - l)erf \d/^/2v^R{{k))\ (15) 

where the 6 function implies that this probability vanishes for 
disconnected graphs. Equating the link formation and removal 
rate, finally provides an equation for (fc) which reads 



A(fc)=,+ (i_,)0((fc)_l)erf 



d/ ^j2vAR{{k)) 



(16) 



Fig. 3 reports the numerical solution of this equation for the 
same parameters as in the simulations. This agreement is rea- 
sonably good in view of the approximations made. Again the 
mean field approach overestimates the size of the coexistence 



region. The mean-field calculation reproduces the main qual- 
itative behavior, even though it (again) overestimates the size 
of the coexistence region. 

The emergence of features (a)-(c) depends crucially on the 
divergence of R{{k)) on the average degree when (fc) w 
1. This divergence gets smoothed when v decreases, which 
suggests that the discontinuous transition should turn into a 
smooth crossover beyond a critical value Vc. This scenario, 
which is reminiscent of the behavior at the liquid-gas phase 
transition, is indeed confirmed by numerical simulations. 



VI. COORDINATING IN A CHANGING WORLD 

We now consider a further specialisation of the general 
framework where link formation requires some form of co- 
ordination, synchronisation, or compatibility. For example, a 
profitable interaction may fail to occur if the two parties do 
not agree on where and when to meet, or if they do not speak 
the same languages, and/or adopt compatible technologies and 
standards. In addition, it may well be that shared social norms 
and codes enhance trust and thus are largely needed for fruit- 
ful interaction. 

To account for these considerations, we endow each agent 
with an attribute Xi which may take one of q different values, 
Xj e {1, 2, . . . , g}. Xi describes the internal state of the agent, 
specifying e.g. its technological standard, language, or the so- 
cial norms she adopts. The formation of a new link ij requires 
that i and j display the same attribute, i.e. Xi — Xj. This is a 
particularisation of the general Eq. with djj = I — S^^^x 
and < d < 1. For simplicity we set e = since in the 
present formulation there is always a finite probability that 
two nodes display the same attribute and hence can link. We 
assume each agent revises its attribute at rate i/, choosing Xi 
dependent on its neighbours' XjS according to: 



P{xi{t) = x} = — exp 



f3 J2 ^- 



,x,{t) 



(17) 



where (3 tunes the tendency of agents to conform with their 
neighbours and Z provides the normalisation. This adjust- 
ment rule coincides with the Kawasaki dynamics, which is 
known to sample the equilibrium distribution of the Potts 
model of statistical physics |32| with temperature T = 
l/(fc_B/3). Eq. (I17> has been used extensively, mainly for 
q = 2, in the socio-economic literature as a discrete choice 
model SHU!. 

This model describes a situation where agents are engaged 
in bilateral interactions which however require a degree of co- 
ordination among partners (i.e. Xi = Xj). The agents attempt 
to improve their situation both by coordinating their value of 
Xi with that of neighbours and by searching for neighbours in 
their same state and linking with them. Link removal mod- 
els decay of links, e.g. due to obsolescence considerations. 
The stochastic nature of the choice rule ( I17t captures a de- 
gree of volatility or un-modelled features on which the inter- 
action depends (e.g. one might think that agent i might have 



some advantage for choosing a given value of Xi at a particular 
time). From the point of view of statistical physics, the pres- 
ence of a non-zero "temperature" prevents the system from 
getting stuck in imperfect states. We will consider these ef- 
fects in more detail below (see section IVIll and Fig. |8} when 
discussing the case /3 ^ cxd in greater detail. 

This is another manifestation of our general idea that 
network-mediated contact favors inter-node similarity. As in 
Section [V] we focus on the case where such a similarity- 
enhancing dynamics proceeds at a much faster rate than the 
network dynamics. That is, i/ :^ ri,X so that, at any given t 
where the network g{t) is about to change, the attribute dy- 
namics on the Xi have relaxed to a stationary state. The statis- 
tics of this state are those of the Potts model on the graph g{t). 
For random graphs of specified degree distribution p(k), the 
necessary statistics of the Potts model can be found |36, 37] 
and this makes an analytic approach to this model possible. 
We shall first discuss an approximate theory to the general 
case and then focus on a particular limit where the model can 
be solved exactly. 




FIG. 4: Results from the analytic solution. Upper plot: Plot of the 
degree distribution, p{k) for rj/X — 4, showing also the Poisson 
distribution with the same (fc) for comparison (dashed lines). Note 
that the degree distribution is not Poissonian. Lower plot: Plot of the 
average magnetisation of a node of degree k for ri/X = 4), showing 
that more highly connected nodes are, on average, more magnetised. 
q = 10. 



A. Method of Solution 

Again we rely on the random graph approximation where 
the network is completely specified by the degree distribution 
p{k). Now however the probability of two nodes being in the 
same state if they are both in the giant component depends on 
the magnetisation of the giant component. The master equa- 
tion for a general node of degree k is, 

p{k) = A(fc + l)p{k + 1) + 2r]p{k - l)n{k - 1) 

-Xkp{k) - 2?7p(fc)7r(fc) (18) 

where 7r(fc) is the probability that a node of degree k is in the 
same state as a randomly chosen node. This crucially depends 
on whether the Potts spins are ordered or not. Indeed, for suf- 
ficiently high f3 the equivalence between the different q spin 
states is broken in the stationary state of the Potts model with 
temperature l/(fcB/3). This is signalled by a non-zero value 
of the magnetization 



q{Sx,i) - 1 



1 



(19) 



where the average is both on the nodes of the giant component 
and on the stationary distribution. Without loss of generality, 
we can assume that a; = 1 is the state which is selected and 
TO > implies {Sx,i) > l/q- Then it is easy to see that 



7r(fc) 



q-l 



7(1 — u ')ni{k)m 



(20) 



where 7, u are defined in Eqs. (I2I4> and ra(k) ~ {q{6x.i\k) — 
l)/{q — 1). Note that 7r(fc) and m{k) depend on k, 
more highly connected nodes being on average more coor- 
dinated/magnetised (see figure|4}. 

With these equations we can find p{k) and TT{k) iteratively. 
Starting from a given p{k), we first compute the properties of 
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FIG. 5: Upper plot: Plots of average degree ((fc)) as a function of 
'temperature' T = 1//3. Lower plot: Plots of the probability that two 
randomly chosen nodes are in the same state (vr), as a function 1//9. 
All plots are for: ri/X = 4 (lower curves) and -q/X — 10 (higher 
curves). Points are results of simulations, q — 10. for n — 1000. 



the Potts model on a random network with such a degree dis- 
tribution, from which we get TT{k) in Eq. ( I20t . This with Eq. 
( list in the stationary state (p(fc) — 0) allows us to estimate 
Xk = p{k)/p{0) from the equation 



Xk+l 



[2riTT(k) + Xk]xk — 2777r(fc —l)xk- 
A(fc+1) 



iteratively, in terms of xq = 1 and xi — 2?77r(0)/A. Nor- 
malization, yields a new estimate of the degree distribution 
p{k) ~ Xk/ J2h ^h- We repeat this cycle until a stable solu- 
tion p{k) is found. 

Figure |5l shows (k) and n plotted against temperature for 
both simulations and theory. Note that the agreement is excel- 
lent despite the approximation made. As before, for high rj/X 
there is a highly connected network with a giant component. 
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FIG. 6: Phase diagram in r; (A = 1) and T, showing the high and 
low connectivity phases and the hysteretic region. Crosses are sim- 
ulations (n = 5000, 10000), plusses are theory, the points are in 
pairs, one on each side of the phase line. The curves are theory, the 
leftmost curve is rjc = (g/2)(exp(l/r) +q- l)/(exp(l/T) - 1) 
which is the expected transition line if the system gets stuck in the 
metastable, unmagnetised state. The right curve is found using the 
normal method described above. 
Upper left: higher connectivity, coordinated region. 
Lower right: lower connectivity, uncoordinated region. 
The central region is the hysteretic region. 



for low 77/A the network is sparsely connected. For interme- 
diate values of rj/X the two states coexist and which one is 
found depends on the initial conditions for p{k). 

Figure |6l shows a phase diagram (simulations and theory) 
for f3 and r//A. It can be seen that the theory is rather close 
to the simulation results. The low uncoordinated region is a 
Poisson random graph with (fc) — 2ri/{Xq). Starting in this 
state, the transition to the highly connected, magnetised state, 
can only occur when there is a giant component, i.e. (k) > 1 
so 77 > Ag/2. Hence the lowest point of the high connec- 
tivity region of figure |6l is at r; = 5 and T = 0. Although 
h' :^ ri,X, the simulations of the Potts model can still get into a 
metastable unmagnetised state, even below the transition tem- 
perature 11231. The temperature T at which this metastable 
state becomes unstable is given by 13711 



exp(l/T) 



(fc^) + (g-2)(fc) 
(fc2) - 2 (fc) ■ 



(21) 



The (uncoordinated) graph is Poissonian, (/c^) = (k) + (k). 
Thus the transition curve is 



Vc 



q exp(l/r) + g - 1 
2 exp(l/T) - 1 



(22) 



Monte-Carlo simulations show that this theoretical line is 
slowly approached as n is increased. 

VII. EXACT SOLUTION FOR T = 0. 



tween nodes that have the same spin, Ui = sigmuj. The key 
intuition is that the spin of site i can change only if ki = 0, i.e. 
if the site is isolated. Hence we can classify sites in disjoint 

subsets N = No U^^i N^ where 



No ^{i:h = 0} 
N^ ^ {i : ki> 0, ai 



<^} 



l,---,q (23) 



The spins cji are frozen for all nodes i G N^y with cr > 
whereas nodes in iVo have spin which are randomly updated at 
a fast rate. Because e = 0, links can only be formed between 
nodes i and j which are either both in the same component 
No- with (T > 0, or both in iVo provided they have the same 
spin, or if one is in N^ and one is in iVo, but has spin aj = a. 
No link can be formed between i E N^ and j S N^r' with 
cr, cr' > 0. 

When a link with a node in iVo is formed, one or two nodes 
pass from A^o to some N„. Likewise nodes of N„ which lose 
their links move to Aq. Such a dynamics, in the limit n — > 00 
is captured by the following evolution for the fraction n^ of 
nodes in set N„ (a > 0) 



q 



-non a 



2»/ 2 
^-0 



Xpa,ina, cr > 1. 



(24) 



Here pcr,fc is the degree distribution of nodes in N^, and 



»o = 1 - y^ : 



(25) 



CT=1 



is fixed by the normalization. The first term in Eq. J24> arises 
from the process where a node of degree zero joins a node of 
type a. The factor 2 is present because either node might have 
initiated the link. The second term is the process where a node 
of degree zero joins another node of degree zero. The factor 
2 is present because n^ increases by 2. The dynamics of the 
degree inside a component is just that leading to a random 
Poissonian graph for ki > 0. Hence pcr,k is given by 



Pa,k 



{e"' -l)/c!' 
where the average degree 

277710 2r]n 



fc> 1 



Xq 



X 



(26) 



(27) 



is obtained by balancing the average link creation rate 
2ri{ncr + no/q) with the link removal rate Xca, inside the 
component A^. The equations above allow we to recast the 
dynamics in the form 



X'c„ 



2r][l - cxp(-Ccr)] 



Xq 



Caexp(-Ccr] 



(28) 



From this it is clear that in the stationary states Ca obeys; 



The model with /3 ^ 00 and e = can be described ex- 
actly. We assume that in the initial state, links exist only be- 



Ca exp(-Co 



2??nQ 
Xq 



(29) 



for cr = 1 to g. Moreover Eq. J27> implies with the constraint 



^c, = 



(T = l 



A ■ 



(30) 



In order to build a solution, let us notice that Eq. J29> has 
two solutions (provided that 2rinQ < Xqe~^) which we denote 
c+ > 1 and c_ < 1. Hence solutions can be specified in terms 
of the number £ of components with c^ = c+. Then Eq. (I30> 
becomes 

2r? 



ic+ + {q- i)c^ = 



A 



(31) 



Equations ( I29II3U can be solved for any value of rj/X and £. 
The degree distribution for the whole network is given by: 



P[k) = — 
9 



't + <'-')t 



The average degree can be written as 

£ci + iq- £y_ 



(fc) 



(32) 



(33) 



£c+ + {q- £)c- 

We will now show that only the solutions with £ — and 1 
are dynamically stable. These are those which describe the 
behavior of the model. 

A. Stability Analysis 

The dynamics can be written as h^ — f{no,na-)- Let 
Tier — fia + e<T where n^ is the solution derived above (i.e. 
/(t7,(j, uq) = 0) with Cc = c+ for a < £ and Ca ~ c_ for 
a > £. Here ea is a small perturbation which, to leading or- 
der, satisfies 



— A y ^ Jo-.i/fi/ 



where T has matrix elements 



T — 



C„{Ca - 1) 

exp(ccr) - 1 



Sn.u 



exp(ccr) - 1 



(34) 



(35) 



A solution is stable if all eigenvalues of T are negative. For 
the £ = solution, Ca = C- = 2r}/{\q) for all a, we find 
g — 1 "transverse" eigenmodes (^^ Ca = 0) with eigenvalue 
/i = — (1 — c_)/(e'^- —1) and one "longitudinal" mode (ecr = 
e) with /i — — c_/(l — e^"^" ). Both are stable {ji < 0) so the 
£ = solution is always stable, as long as c_ < 1, i.e. for 
277 < Xq (see Eq.lSTl. 

It is also easy to find an unstable mode for solutions with 
£ > 2 components in the c+ state. Let c^ = c+ for a < £ and 
C(j — c_ otherwise and consider "transverse" perturbations 
with Ecr — for a > £ and ^^ e^ = 0. These describe den- 
sity fluctuations among c+ components. We find e^ — Xjie 
with fj, = c+(c+ — l)/(e'^+ — 1) > 0. This means that any per- 
turbation of £ > 1 solutions with an imbalance between two 
or more components with c^r = c+ will grow exponentially, 
thus leading to the collapse of all but one of the components. 
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T\IX 



FIG. 7: Plots of the mean connectivity {k) as a function of 77/A. 
Lines are theory, crosses are simulations, n = 10000, run to f = 100 
for equilibration, then to f = 200 for data taking. For n=10000 
the q=10 low state is unstable below the predicted value of 5 due 
to fluctuations being significant for finite n. The crosses that do not 
lie on the theory curves are systems which 'jumped' during the data 
taking. 



B. The ^ = 1 solution 

Combining equations J29II3U . the equation for c_ with £ = 
1 can be written as 



277/A - qc- 
c- ^-77 7 = 0. 



(36) 



This equation has no solution for 77 < j]c, where 77c is the 
point where the maximum of the l.h.s. of Eq. ( I36> as a func- 
tion of c_, becomes zero. Beyond this point (77 > 7/c) two 
solutions appear. The one with larger value of c_ merges with 
the £ = solution, as c_ ^ 1 when rj -^ Xq/2. This solution 
is un-physical as it describes a network where c+, and hence 
the average degree, decreases with the networking effort rj (or 
with decreasing volatility A). Indeed, a detailed analysis of 
the linear stability reveals the presence of an unstable mode 

a. 

The lower branch instead has c_ -^ Q as rj ^ 00 and it 
describes a physical solution with average degree increasing 
with 7;/A. Numerical analysis of the stabihty matrix shows 
that this branch is indeed dynamically stable. 

The critical point ric{q) at which the £ = 1 solution con- 
verges to A when q ^ 2, which is the point where the £ ^ Q 
solution ceases to exist. So the transition is continuous for 
(7 = 2 and there is only one branch. For q = 10 we find 
rjc — 2.27989. ..A and for large q we find rjc oc \ogq. 

In summary, the system has either one or two stable states, 
depending on the values of q and 77/A. For i] < rjc only the 
solution £ = is stable, for rj > Xq/2 only the solution £ = 1 
is stable. Finally in the interval rjc < rj < Xq/2 there are two 
stable solutions. The coexistence region [i]c, Xq/2] shrinks to 
a single point when q = 2 and it gets larger as q increases. 

Figure0shows plots of (fc) against 77/A for q — 10 for both 
simulations and the theory described here. 
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FIG. 8: Plots of mean connectivity k against time for networks start- 
ing in an initially unconnected state with n — 5000 and g = 10 and 
A = 1. The curve on the top right is 77 = 20 and the other is 7; = 10. 
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might be rational for agents to resort to a stochastic choice 
rule. The ability to find an optimal state more quickly is also 
of advantage if there are external (exogenous) shocks which 
occasionally perturb the system. 



C. Discussion 

The T = case is of particular interest because it can be 
solved exactly. For the other models described here, the co- 
ordination/correlation of the nodes were too complex for us 
to analyse exactly. In this section we have exactly solved a 
non-trivial network model. This was possible because of the 
fact that an agent only changes its spin if it has degree zero. 
The network is found to be the sum of q Poissonian random 
graphs 1.3 9il . 



Vni. CONCLUSION 



Concerning the degree distribution, it is worth noticing that 
the £ = solution is characterized by a trivial Poisson dis- 
tribution for the whole random graph. Since (k) = c_ < 1, 
there is no giant component and the system is composed of 
many disconnected components of few nodes. The solution 
with £ = 1 is however non-trivial. In this case we have a net- 
work whose p{k) is the sum of two Poisson distributions, one 
of which has c+ > 1 and thus a giant component, whilst the 
other (consisting of q — 1 separate networks plus the fc = 
nodes) has c_ < 1 and thus has no giant component. 

Even if ^ > 1 states are unstable, they may occur in the 
early stages of the stochastic evolution of the network. Figure 
|8]shows time-series plots of simulations for relatively large rj 
(rj — 10, 20), starting in an initially unconnected state. Al- 
though the T] — 10 solution eventually reaches its expected 
value of (fc) ^ 20, its approach to that value is not smooth 
as might have been expected. Rather we find that the net- 
work spends some time in an intermediate £ > 1 metastable 
state. As we move to 77 = 20, the time spent in metastable 
states increases substantially, failing to reach the stationary 
£ — 1 state (where (fc) ^ 40) despite the relatively long simu- 
lation time. The reason for this behavior is that initially more 
than one giant component forms, with different values of a. 
This state persists for a typical time tmcta which is inversely 
proportional to the eigenvalue of the unstable mode. Hence 
^mcta ~ 1/a* ^ e'^+r]/{\c?j^) becomes very long when c+ is 
large. The reason why the dynamics is so slow depends on 
the fact that in order for nodes to migrate from one compo- 
nent a to another one, they have to loose all their links. Such 
a process is limited by the density p^.i of nodes in compo- 
nent a with fc = 1, which is very small when c+ is large 

In such a situation introducing a stochastic element in 
agents' choice (i.e. switching on T > 0) or allowing for the 
formation of uncoordinated links (i.e. e > 0) would make the 
system converge very fast to the coordinated state. In other 
words, this is a case where a finite "temperature" may allow 
the agents to find the global optimum more quickly and it 



In this paper we have proposed a general theoretical setup to 
study the dynamics of a social network that is flexible enough 
to admit a wide variety of particular specifications. We have 
studied three such specifications, each illustrating a distinct 
way in which the network dynamics may interact with the 
adjustment of node attributes. In all these cases, network 
evolution displays the three features (sharp transitions, re- 
silience, and equilibrium co-existence) that empirical research 
has found to be common to many social-network phenomena. 
Our analysis indicates that these features arise as a conse- 
quence of the cumulative self-reinforcing effects induced by 
the interplay of two complementary considerations. On the 
one hand, there is the subprocess by which agent similarity 
is enhanced across linked (or close-by) agents. On the other 
hand, there is the fact that the formation of new links is much 
easier between similar agents. When such a feedback process 
is triggered, it provides a powerful mechanism that effectively 
offsets the link decay induced by volatility. 

The similarity-based forces driving the dynamics of the 
model are at work in many socio-economic environments. 
Thus, even though fruitful economic interaction often requires 
that the agents involved display some "complementary diver- 
sity" in certain dimensions (e.g. buyers and sellers), a key 
prerequisite is also that agents can coordinate in a number of 
other dimensions (e.g. technological standards or trading con- 
ventions). Analogous considerations arise as well in the evo- 
lution of many other social phenomena (e.g. the burst of social 
pathologies discussed above) that, unlike what is claimed e.g. 
by Crane |7], can hardly be understood as a process of epi- 
demic contagion on a given network. It is by now well under- 
stood 1I40II41I1 that such epidemic processes do not match the 
phenomenology reported in empirical research. Our model 
suggests that a satisfactory account of these phenomena must 
aim at integrating both the dynamics on the network with that 
of the network itself as part of a genuinely co-evolutionary 
process. 

One common feature of all the models discussed in this pa- 
per is that stable states can have either a single giant compo- 
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nent or none. Many real situations are characterized by stable 
states with a multitude of distinct components, barely con- 
nected. One example is the polarization of opinion (e.g. in 
politics) where the tendency of individuals to have opinions 
similar to those of the peers they interact with may also lead 
to the segregation of the population in different communities, 
of like mined individuals. The results presented here suggest 
that there must be a specific mechanism which is responsi- 
ble for such a polarization. We hope that future work in this 



direction may shed some light on this issue. 
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